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This article briefly describes a set of compressed, then reconstructed, test images 
submitted to the CRAF/Cassini project as part of its evaluation of near-lossless 
high-compression algorithms for representing image data. A total of seven test 
image files were provided by the project. 

The seven test images have been compressed , then reconstructed with high qual- 
ity (root-mean-square error of approximately one or two gray levels on an 8-bit gray 
scale), using discrete cosine transforms or Hadamard transforms and efficient en- 
tropy coders . The resulting compression ratios varied from about 2:1 to about 10:1 , 
depending on the activity or randomness in the source image. This was accom- 
plished without any spedai effort to optimize the quantizer or to introduce special 
postprocessing to filter the reconstruction errors. 

A more complete set of measurements , showing the relative performance of the 
compression algorithms over a wider range of compression ratios and reconstruction 
errors, shows that additional compression is possible at a small sacrifice in fidelity. 


I. Introduction 

This article briefly describes a set of compressed, then 
reconstructed, test images submitted to the CRAF/ 
Cassini project as part of its evaluation of near-lossless 
high-compression algorithms for representing image data. 
A total of seven test image files were provided by the 
project. Five test images ( d 1 , f2, h2, jl, and 12) are star 
fields from the Hubble Space Telescope, and two images 
(saturn 1 and saturn2) are views of Saturn from Voyager. 
Three of these original images are shown in Figs. 1(a), 
1(b), and 1(c). The dimensions of the Hubble images and 


the Saturn images are 256 x 256 and 800 x 800, respec- 
tively. All images are represented by 8-bit pixel values in 
the range 0 to 255. 

A total of 12 compressed/reconstructed images were re- 
turned to the project, as listed in Table 1. Three of the 
reconstructed images (marked by arrows => in Table 1) 
are shown in Figs. 2(a), 2(b), and 2(c) for comparison 
with the originals. Each of the seven test, images was 
compressed using an algorithm that produces high qual- 
ity reconstructed images (left-hand portion of Table 1) 
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with a root-mean-square error (RMSE) of about one gray 
level. Alternate compressed/reconstructed image versions 
(right-hand portion of Table 1) were also provided for five 
of the seven test images. Three of these alternate images 
show how much additional compression is possible at a 
small sacrifice in image fidelity, and the other two alter- 
nate images illustrate the effectiveness of a different com- 
pression algorithm, which is simpler to implement on the 
spacecraft. 

In Table 1, the RMSE is computed in absolute units on 
an 8-bit gray level scale. The quoted bit rate is the number 
of bits per pixel required to encode the compressed image 
before reconstruction. The compression ratio is calculated 
as 8 bits divided by the bit rate. 


II. Description of the Compression System 

The specific algorithms used to compress, then recon- 
struct the twelve images listed in Table 1 can be described 
with reference to the block diagram in Fig. 3. The vari- 
ous blocks in this diagram are described in the following 
sections. 

A. Data Transform/Inverse Data Transform 

A discrete cosine transform (DCT) was applied to ob- 
tain ten of the twelve compressed/reconstructed images. 
The DCT is near-optimal for a wide variety of images, 
and is fast becoming an industry standard for high com- 
pression. The DCT was calculated using floating point 
arithmetic and applied to 8 x 8 sub-blocks of the image. 

A Iladamard transform (IIT), also applied to 8 x 8 
blocks, was used for the remaining two compressed/ 
reconstructed images. The HT is generally not as effec- 
tive as the DCT, but it performed reasonably well for the 
seven test images. The IIT is simpler to implement than 
the DCT, because it can be computed with integer arith- 
metic and without multiplications. 

Mathematically, both transforms are defined as unitary 
transformations on each 8x8 block of data. The image 
array X is decomposed into 8x8 blocks X IJ , i.e., X — 
[X 7 * 7 ], and the array of transform coefficients T is built 
from 8x8 blocks T IJ , i.e., T = \T IJ ] . The blocks of 
transform coefficients are given by 


i CX lJ £ r for DCT 
\ HX IJ H for IIT 


where the 8x8 matrices C = [c,*j] and H — [A,*j] are 
defined by 


Cii — 


II = 


v^2cos 

( * >(2j + l) 

V 16 

<= 

1 , 2 ,: 

5, 4, 5, 6, 7 



/ 

j = 

0,1, 

2, 3, 4, 5, 6,7 

l 1 


i ~ 

= 0, j 

= 0,1,2, 3, 4 

- 1 

1 1 

1 1 

1 

1 1 “ 

1 

1 1 

1 -1 

-1 

-1 -1 

1 

I -1 - 

-1 -1 

-1 

1 1 

1 

1 -1 - 

-1 1 

1 

-1 -1 

1 - 

1 -1 

1 1 

-1 

-1 1 

1 - 

1 -1 

1 -1 

1 

1 -1 

1 - 

1 1 - 

-1 -1 

1 

-1 1 

. 1 - 

1 1 - 

-1 1 

-1 

1 -1 . 


H 1 


Both the DCT and the IIT are exactly invertible in 
principle. The inverse transform formulas are simply: 


X IJ = 


1 c t t ij c for DCT 


5 HT IJ H for HT 


Both the DCT and IIT can be implemented with fast 
algorithms requiring fewer arithmetic operations than di- 
rect implementation of the matrix multiplications in the 
above expressions. The DCT requires real multiplications 
and additions, whereas the IIT requires only integer ad- 
ditions and no multiplications. For this study, the DCT’s 
real arithmetic was approximated by 32-bit floating point 
multiplications and additions, whereas integer arithmetic 
must generally be substituted in practice. Integer approx- 
imations in the computation of the transform coefficients 
can produce additional errors in the reconstructed images. 

Transforming a block of data does not change its in- 
formation content. Useful transforms concentrate most of 
the data’s energy into a small number of transform coeffi- 
cients. Low-energy transform coefficients can be encoded 
with a small number of bits. 

B. Quantizer/Dequantizer 

The DCT produces real transform coefficients, which of 
necessity must be quantized to a finite number of bits. The 
HT produces quantized coefficients, but the quantization 
is unpractically fine: the transform coefficients have eight 
times the dynamic range and one-eighth the granularity of 
the input, requiring six extra bits to represent exactly. So 
additional quantization is also performed for the HT. 
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Once the transform coefficients have been quantized, 
the dequantizer in Fig. 3 can only reconstruct an approxi- 
mate version of the true coefficients, and the inverse data 
transform can no longer regenerate the exact original im- 
age. Except for the possible errors (noted above) in com- 
puting the transform coefficients, quantization of the com- 
puted coefficients is the only step in Fig. 3 that intro- 
duces errors in the reconstructed image data. The choice 
of quantization coarseness and uniformity thus sets the fi- 
delity of the reconstructed image. This choice also limits 
the extent to which the entropy encoder can compress the 
image. 

A uniform quantizer was used for all twelve of the com- 
pressed/reconstructed images. Mathematically, the out- 
put of the quantizer is an array of 8 x 8 blocks, Q — [Q IJ ]^ 
where each block, Q IJ = [Q{j]> is obtained from the 8x8 
block of transform coefficients, T IJ = [T^* 7 ], as 


where q is the quantizer step size and [aj is the largest 
integer less than or equal to a. This uniform quantizer is 
actually “triply uniform.” Not only are the quantization 
levels for each transform coefficient equally spaced, but the 
quantization step size q is the same for all 64 coefficients 
within each 8x8 block, and the step size does not change 
from block to block. A step size of q = 4 was used for nine 
of the twelve images, and a coarser step size of q — 8 was 
used for three of the alternate images. 

Nonuniform quantization rules are available [1,2] to 
match the quantizer to the human visual response by 
selectively quantizing low-frequency DCT or IIT coeffi- 
cients more finely than high-frequency coefficients. Simi- 
larly, other algorithms can adapt the quantizer to the local 
statistics of the data on a block by block basis. However, 
such quantizers have so many adjustable parameters that a 
nonuniform quantizer optimized for a small set of test im- 
ages would not fairly reflect the performance of the quan- 
tizer for untested images. Research is ongoing to find uni- 
versal nonuniform, adaptive quantizers that consistently 
outperform the uniform quantizer. 

C. Entropy Encoder/Entropy Decoder 

The entropy encoder losslessly encodes the array of 
quantized transform coefficients Q into a bit stream b 
with bit rate approaching the entropy per coefficient of 
Q. Several types of highly efficient coders are available for 
this purpose. Among these are the Gal lager-van- Voorhis- 
Huffman (GVII) coder [3], a variant of IBM’s arithmetic 


Q-coder [4] being developed for the Joint Photographic Ex- 
perts Group (JPEG) standard [2], and a baseline Huffman 
coder for this same standard. Variations of these coders 
were used to compress, then reconstruct the images listed 
in Table 1. 

The first step in all three coding schemes is to ar- 
range the quantized transform coefficients Q IJ — \Q\j\ 
into an ordered sequence, starting with the DC coefficient 
(z rr j = 0) in the upper left-hand corner of the trans- 
formed block. The remaining 63 coefficients (AC coeffi- 
cients) are ordered in some fashion, generally via a zigzag 
readout starting at the upper left-hand corner and working 
toward the lower right-hand corner. This zigzag sequence 
arranges the AC coefficients in increasing order of spatial 
frequency. 

The GVH coding scheme is derived based on two obser- 
vations on the quantized AC and DC coefficients. First, 
the AC coefficients and the differences between adjacent 
DC coefficients have two-sided geometric distributions. 
Second, runs of zeros occur frequently in the zigzag se- 
quences of AC coefficients, especially at high compression 
ratios. By extending a result originally shown by Gallager 
and van Voorhis, a near-optimal adaptive coding scheme 
for prefix coding the two-sided geometric source is derived, 
avoiding both binning calculations and the Huffman tree 
generation algorithm. Instead, this scheme estimates the 
local activity of each 8x8 block by counting the num- 
ber of zeros in the block or in some preceding blocks, and 
adaptively encodes the transform coefficients using simple 
pipelined table lookup operations. An optional runlength 
code can also be used to encode runs of zeros in the zigzag 
sequence of AC coefficients. 

The Q-coder is a lossless, binary entropy coder, devel- 
oped by researchers at IBM, that efficiently implements an 
Elias code [5] on an input bit sequence. A coarsely quan- 
tized approximation to the real interval [0,1], or a scaled 
version thereof, is recursively subdivided into two sections, 
whose sizes are proportional to probability estimates that 
the bit currently being coded is a 0 or I. By dynamically 
updating these estimates using a finite-state machine, the 
Q-coder adapts to input data statistics (unlike a Huffman 
encoder, which requires statistics before coding), which 
makes it both robust and efficient. A coding model forms 
a binary sequence from a raster-scan ordering of the 64 
quantized DCT or HT coefficients in each block. For each 
integer, an equal-to-zero flag bit, the sign bit, the position 
of the most significant bit, and then the least significant 
bits are sent through the Q-coder. Runlength coding was 
not performed because the small quantizer step sizes used 
make it unprofitable. A simple model, using only 12 prob- 
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ability estimates, each of which is a 5-bit number (state), 
was used instead of the complex model in Section 8 of [2] 
that requires 252 estimates. Negligible bit rate reductions 
are expected with the latter model for the images tested. 
The output sequence length is very close to a value calcu- 
lated from the input stream entropy. Since only additions, 
subtractions, and comparisons are utilized by a Q-coder, 
it is simple and fast in practice. The particular variant 
implemented is described in Section 12 of [2]. 

The Joint Photographic Experts Group of the Inter- 
national Standards Organization/International Telegraph 
and Telephone Consultative Committee (ISO/CCITT) [2] 
is currently developing an international standard for still- 
image compression. In its baseline version, the proposed 
algorithm consists of an 8 x 8 DCT, coefficient quantiza- 
tion, and Huffman or arithmetic coding. This scheme pro- 
vides a near-lossless, high-compression image coding capa- 
bility, which preserves image fidelity at compression rates 
competitive or superior to most known techniques. The 
DCT’s 64 coefficients are independently uniformly quan- 
tized with a different step size for each coefficient. The 
DC component is differentially encoded, and the AC com- 
ponents are runlength encoded. Finally, some of the most 
significant bits of each resulting code are further encoded 
with a variable length code; the remaining bits are trans- 
mitted as they are. In the case of Huffman encoding, the 
JPEG default tables were used. The tables for the Huff- 
man codes can be easily customized to adapt to the par- 
ticular image source of interest. 

The bit rates listed in Table 1 are the bit rates achieved 
by a variant of the Q-coder. The GVH and JPEG coders 
achieve comparable bit rates averaging 0.2 to 0.3 bits per 
pixel higher than the Q-coder for the images in Table 1. 

D. Noiseless Channel 

Because the entropy code is lossless, the entropy de- 
coder is able to reconstruct an exact replica of the quan- 
tized transform coefficients, given the compressed files of 
coded bits. However, if the channel in Fig. 3 were not 
noiseless, the decoding process would be severely disrupted 
and errors might propagate wildly. 

E. Preprocessing and Postprocessing (Not 
Implemented) 

The original image data supplied by the project were 
actually obtained by preprocessing 12-bit data available 
from the cameras. The 12-bit data were subjected to a 
square-root operation, then quantized to 8 bits. No addi- 
tional preprocessing was performed on the 8-bit data be- 
fore the transform operation depicted in Fig. 3. The 8-bit 


data provided by the project were considered to be the 
original image data for the purposes of the tests reported 
here. 

Some sort of postprocessing is often desirable, follow- 
ing the inverse data transform, to make the reconstruc- 
tion errors less noticeable. Postprocessing can be ap- 
plied to remove visually disturbing blockiness in images 
that have been highly compressed by block-transform tech- 
niques. No such postprocessing was performed for any of 
the twelve compressed/reconstructed images, because it 
w’ould unfairly mask the true efficacy of the compression 
algorithms. 

A crude form of nonoptimum postprocessing actually 
did take place after the inverse transform, because the out- 
put values had to be quantized to 8 bits to fit the original 
image format. This quantization step should be skipped or 
deferred if the output data are subjected to further post- 
processing (such as removal of the square-root operation 
mentioned above). 

III. Additional Performance Results 

The twelve reconstructed images listed in Table 1, in- 
cluding the three images shown in Figs. 2(a), 2(b), and 
2(c), were chosen to reflect the desires of the project to ob- 
tain compression ratios in the range of about 2:1 to about. 
10:1 with essentially zero reconstruction error. A more 
complete set of measurements showing the relative perfor- 
mance of the compression algorithms over a wider range 
of compression ratios and reconstruction errors was also 
obtained. These results are plotted in Figs. 4(a), 4(b), 
4(c), 5(a), 5(b), and 5(c) for the three images shown in 
Figs. 1(a), 1(b), and 1(c), using DCT-based algorithms. 
Figures 4(a), 4(b), and 4(c) show the bit rate (bits per 
pixel) of the compressed images as a function of the RMSE 
distortion, and Figs. 5(a), 5(b), and 5(c) show the cor- 
responding compression ratios. In these figures (he bit 
rates and compression ratios achieved by the three entropy 
coders are compared with each other and with an approx- 
imate bound based on the estimated entropy of the quan- 
tized DCT coefficients. This entropy "‘bound” is derived 
assuming stationary statistics throughout the image; it can 
sometimes be beaten by algorithms capable of adapting to 
locally varying statistics. 

IV. Summary 

The seven test images have been compressed, then re- 
constructed with high quality (RMSE of approximately 
one or two gray levels on an 8-bit gray scale) using DCT- 
or IIT-based schemes and efficient entropy coders. The re- 
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suiting compression ratios varied from about 2:1 to about 
10:1, depending on the activity or randomness in the 
source image. This was accomplished without making any 
special effort to optimize the quantizer or to introduce spe- 
cial postprocessing to filter the reconstruction errors. 


A more complete set of measurements, showing the rel- 
ative performance of the compression algorithms over a 
wider range of compression ratios and reconstruction er- 
rors, shows that additional compression is possible at a 
small sacrifice in fidelity. 
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Table 1. List of original and compresaed/reconstructed teat images. Arrows =► denote the three Images 

shown In Figs. 2(a), 2(b), and 2(c). 


C ompressed /Reconstructed 
images* 


Alternate compressed/ 
reconstructed images 13 


Original 


images 

RMSE, 

absolute 

Bit rate, 
bits/pixel 

Compression 

ratio 

RMSE, 

absolute 

Bit rate, 
bits/pixel 

Compression 

ratio 

dl 

1.19 

1.50 

5.33 

— 

— 

— 

f2 

1.15 

1.15 

6.96 

— 

— 

— 

h2 

1.16 

1.21 

6.61 

(H) 1.18 

1.25 

6.40 

jl 

1.19 

3.77 

2.12 

(D) 2.33 

2.56 

3.12 

12 

1.19 

2.04 

3.92 

=* (D) 2.25 

1.10 

7.27 

saturnl 

0.97 

1.31 

6.11 

=> (D) 1.43 

0.75 

10.67 

saturn2 

=> 0.87 

0.82 

9.76 

(H) 0.95 

1.00 

8.00 


* Using Q-coder and DOT with quantization step size q = 4. 

b Using Q-coder and either DCT with quantization step size q = 8 (D) or HT with step size q = 4 (H). 
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Fig. 1. Original test Images tor CRAF/Casslnl: 
(a) Hubble image “12," (b) Saturn image 

"saturnl," and (c) Saturn image M saturn2.'* 


<a> 

m 



Fig. 2. Reconstructed test Images for CRAF/ 
Cassini: (a) Hubble image "12, " (b) Saturn image 
"saturnl,” and (c) Saturn image “saturn2.” 
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Fig. 3. Block diagram of compression system. 
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Fig. 4. Bit rate versus RMSE distortion for: (a) Hubble image "12, ” (b) Saturn image "saturnl," and (c) Saturn image "saturn2. 






Fig. 5. Compression ratio versus RMSE distortion for: (a) Hubble image "12,” (b) Saturn image “saturnl,” and (c) Saturn 

image “saturn2." 






